Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences)

نویسندگان

  • Tomas Pfister
  • James Charles
  • Andrew Zisserman
چکیده

We present a framework that automatically and quickly learns a large number of signs from sign language-interpreted TV broadcasts by exploiting supervisory information available in the subtitles. Our contributions are: (i) we show that, somewhat counter-intuitively, mouth patterns are highly informative for distinguishing words in a language for the Deaf, and their co-occurrence with signing can be used to significantly reduce the correspondence search space; and (ii) we develop a multiple instance learning method using an efficient discriminative search, which determines a candidate list for the sign with both high recall and precision. The previous approach of Buehler et al. [2] for learning signs relies on complex features and a computationally expensive, application-specific learning framework. This has hindered the large scale application of this method. In this paper we describe a method that is much simpler and computationally lighter. Motivation. TV programmes in many countries across the world are now routinely broadcast with both subtitles and an overlaid signer translating to the Deaf audience (Fig. 2). Our aim is to use this material to learn signs corresponding to English words in the subtitles [2, 3]. We use this continuous and rich source of training material to build a database of word-sign pairs for a large number of signs and signers. The vision is that this database can later be used to train a large-scale person-independent sign language to text translator. Summary of method. We cast the problem as one of Multiple Instance Learning (MIL), where the training data are visual descriptors (hand trajectories) with weak supervision from subtitles. We proceed in three steps: (i) the search space for correspondences is significantly reduced by exploiting lip and hand motion co-occurrences to filter away irrelevant intervals of the temporal sequences; (ii) candidates for the signs are obtained using an efficient discriminative search over all remaining sequences by casting each candidate as a classifier for the positive and negative sequences; and finally (iii) these candidates are then selected or rejected using the MIL support vector machine framework (MI-SVM) [1]. Fig. 1 illustrates the processing steps and Fig. 3 shows example results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

THE EFFECT OF STANDARD AND REVERSED SUBTITLING VERSUS NO SUBTITLING MODE ON L2 VOCABULARY LEARNING

Audiovisual material accompanied by interlingual subtitles is a powerful pedagogical tool which can help improve the vocabulary learning of second-language learners. This study was intended to determine whether or not the mode (standard and reversed) of subtitling affects the incidental vocabulary acquisition of Iranian L2 learners while watching TV programs. Forty-five participants were random...

متن کامل

Gender, Social Class and Peer Group Influences on Teenagers’ TV Involvement in a Developing Country

The aim of this paper is to develop and confirm a multi-item measurement scale for peer group influences on teenagers’ TV involvement in a developing country and report on the role teenager’s gender and social class background plays in this regard. Various researchers have proposed reinforcement, modelling, motivation, co-viewing and mediation as the domain items for peer group influence on tee...

متن کامل

Does Weather Matter?: Causal Analysis of TV Logs

Weather affects our mood and behaviors, and many aspects of our life. When it is sunny, most people become happier; but when it rains, some people get depressed. Despite this evidence and the abundance of data, weather has mostly been overlooked in the machine learning and data science research. This work presents a causal analysis of how weather affects TV watching patterns. We show that some ...

متن کامل

Time dependency in TV viewer clustering

Web-based catch-up TV services allow users to watch programs at their favoured time and device and are revolutionizing the existing TV watching habits. With the increasing offer and demand for catch-up TV, it has become evident that there is a need for personalised recommendations that will help users to pick programs of interest from a large collection of available content. In order to mitigat...

متن کامل

Relationship between Television Viewing and Language Delay in Toddlers: Evidence from a Korea National Cross-Sectional Survey

PURPOSE This study investigated the relationship between 2-year-old children's exposure to TV and language delay. METHODS The subjects of this study were 1,778 toddlers (906 males and 872 females) who participated in the Panel Study on Korean Children conducted in 2010. The linguistic ability of the toddlers was measured with the K-ASQ (Korean-Ages and Stages Questionnaire). The relationship ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013